FORM 1 (cDNA sequence provided) : 

1 ATGGACAGAG TTTATGAAAT TCCTGAGGAG CCAAATGTGG ATCCGGTTTC 
51 ATCTCTGGAG GAAGATGTCA TCCGTGGAGC CAACCCCCGA TTTACTTTTC 
101 CATTTAGCAT CCTTTTCTCC ACCTTTTTGT ACTGTGGGGA GGCTGCATCT 
151 GCTTTGTACA TGGTTAGAAT CTATCGAAAG AATAGTGAAA CTTACCGGAT 
201 GACATACACC TTTTCTTTCT TTATGTTTTC ATCCATTATG GTCCAGTTGA 
251 CCCTCATTTT TGTCCACAGA GATCTAGCCA AAGATAAACC GCTATCATTA 
301 TTTATGCATC TAATCCTCTT GGGACCTGTT ATCAGATGTT TGGAGGCCAT 
351 GATTAAGTAC CTCACACTGT GGAAGAAAGA GGAGCAGGAG GAGCCCTATG 
4 01 TCAGCCTCAC CCGAAAGAAG ATGCTAATAG ATGGCGAGGA GGTGCTGATA 
451 GAATGGGAGG TGGGCCACTC CATCCGGACC CTGGCTATGC ACCGCAATGC 
501 CTACAAACGT ATGTCACAGA TCCAAGCCTT CCTGGGCTCA GTGCCCCAGC 
551 T G AC CT AT C A GCTCTATGTG AGCCTGATCT CTGCAGAGGT TCCCCTGGGT 
601 AGAGTTGTGC TAATGGTATT TTCCCTGGTA TCTGTCACCT ATGGGGCCAC 
651 CCTTTGCAAT ATGTTGGCTA TCCAGATCAA GTACGAT GAC TACAAGATTC 
7 01 GCCTTGGGCC ACT AG AAGT C CTCTGCATCA CCATCTGGCG GACATTGGAG 
751 ATCACTTCCC GCCTCCTGAT TCTGGTGCTC TTCTCAGCCA CTTTGAAATT 
801 GAAGGCTGTG CCCTTCCTAG TGCTCAACTT CCTGATCATC CTCTTTGAGC 
851 CCTGGATTAA GTTCTGGAGA AGTGGTGCCC AGATGCCCAA TAACATTGAG 
901 AAAAACTTCA GCCGGGTCGG CACTCTGGTG GTCCTGATTT CAGTCACCAT 
951 CCTCTATGCT GGCATCAACT TCTCTTGCTG GTCAGCTTTG CAGTTGAGGT 
1001 TGGCAGACAG AGATCTCGTC GACAAAGGGC AGAACTGGGG ACATATGGGC 
1051 CTGCACTATA GTGTGAGGTT GGTAGAGAAT GTGATCATGG TCTTGGTTTT 
1101 TAAGTTCTTT GGAGTGAAAG TGTTACTGAA TTACTGTCAT TCCTTGATTG 
1151 CCTTGCAGCT CAT TAT T G CT TATCTGATTT CCATTGACTT CATGCTCCTT 
1201 TTCTTCCAGT ACTTGCATCC ATTGCGCTCA CTCTTCACCC ATAATGTAGT 
1251 AGACTACCTC CATTGTGTCT GCTGTCACCA GCACCCTCGG ACCAGGGTTG 
1301 AGAACTCAGA GCCACCCTTT GAGACTGAAG CAAGGCAAAG TGTTGTCTGA 

FEATURES: 

Start Codon: 1 
Stop Codon: 134 8 
3 f UTR: 1351 

FORM 2 (transcript sequence provided) : 

1 ATGAACACAA GACCACAACA TTCAGAAAGA ACCTCGACAA TGGACAGAGT 

51 TTATGAAATT CCTGAGGAGC CAAATGTGGA TCCGGTTTCA TCTCTGGAGG 

101 AAGATGTCAT CCGTGGAGCC AACCCCCGAT TTACTTTTCC ATTTAGCATC 

151 CTTTTCTCCA CCTTTTTGTA CTGTGGGGAG GCTGCATCTG CTTTGTACAT 

201 GGTTAGAATC TATCGAAAGA ATAGTGAAAC TTACTGGATG ACATACACCT 

251 TTTCTTTCTT TATGTTTTCA TCCATTATGG TCCAGTTGAC CCTCATTTTT 

301 GTCCACAGAG AT C TAG C C AA AGATAAACCG CT AT CAT TAT TTATGCATCT 

351 AATCCTCTTG GGACCTGTTA TCAGATGTTT GGAGGCCATG ATTAAGTACC 

4 01 TCACACTGTG GAAGAAAGAG GAGCAGGAGG AGCCCTATGT CAGCCTCACC 

4 51 CGAAAGAAGA TGCTAATAGA TGGCGAGGAG GTGCTGATAG AATGGGAGGT 

501 GGGCCACTCC ATCCGGACCC TGGCTATGCA CCGCAATGCC TACAAACGTA 

551 TGTCACAGAT CCAAGCCTTC CTGGGCTCAG TGCCCCAGCT GACCTATCAG 

601 CTCTATGTGA GCCTGATCTC T G C AG AG GT T CCCCTGGGTA GAGTTGTGCT 

651 AATGGTATTT TCCCTGGTAT CTGTCACCTA TGGGGCCACC CTTTGCAATA 

701 TGTTGGCTAT CCAGATCAAG TACGATGACT ACAAGATTCG CCTTGGGCCA 

7 51 CTAGAAGTCC TCTGCATCAC CATCTGGCGG ACATTGGAGA TCACTTCCCG 

801 CCTCCTGATT CTGGTGCTCT TCTCAGCCAC TTTGAAATTG AAGGCTGTGC 

851 CCTTCCTAGT GCTCAACTTC CTGATCATCC TCTTTGAGCC CTGGATTAAG 

901 TTCTGGAGAA GTGGTGCCCA GATGCCCAAT AACATTGAGA AAAACTT CAG 

951 CCGGGTCGGC ACTCTGGTGG TCCTGATTTC AG T C AC CAT C CTCTATGCTG 

1001 GCATCAACTT CTCTTGCTGG TCAGCTTTGC AGTTGAGGTT GGCAGACAGA 

1051 GATCTCGTCG ACAAAGGGCA GAACTGGGGA CATATGGGCC TGCACTATAG 

1101 TGTGAGGTTG GTAGAGAATG TGATCATGGT CTTGGTTTTT AAGTTCTTTG 

1151 GAGTGAAAGT GTTACTGAAT TACT GT CAT T CCTTGATTGC CTTGCAGCTC 

1201 ATTATTGCTT ATCTGATTTC CATTGGCTTC ATGCTCCTTT TCTTCCAGTA 

1251 CTTGCATCCA TTGCGCTCAC TCTTCACCCA TAATGTAGTA GACTACCTCC 

1301 ATTGTGTCTG CTGTCACCAG CACCCTCGGA CCAGGGTTGA GAACTCAGAG 

1351 CCACCCTTTG AGACTGAAGC AAGGCAAAGT GTTGTCTGA 
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HOMOLOGOUS PROTEINS: 

Top BLAST Hits: 

gi| 6502963 |gb|AAFl4527.1|AFl55511_l (AF155511) KX antigen [Mus . 

gi | 10835267 | ref |NP_066569. 1| Keil blood group precursor (McLeod. 
gi(2135606ipir[|I39294 McLeod syndrome-associated protein XK - . 
gi| 3183551 |sp|P51811iXK_HUMAN MEMBRANE TRANSPORT PROTEIN XK (KX. 
gi | 47593301 ref |NP_004668. 1| Testis-specif ic XK-related protein . 
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BLAST to dbEST: 

gi 11891549 /dataset=dbest /taxon-9606 . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 
gi 1 1891549 Germinal center B cells 

Expression information from PCR-based tissue screening panels: 
Mixed tissue 



Score E 
383 e-104 
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FORM 1: 



1 MDRVYEIPEE PNVDPVSSLE EDVIRGANPR FTFPFSILFS TFLYCGEAAS 
51 ALYMVRIYRK NSETYRMTYT FSFFMFSSIM VQLTLIFVHR DLAKDKPLSL 
101 FMHLILLGPV IRCLEAMIKY LTLWKKEEQE EPYVSLTRKK MLIDGEEVLI 
151 EWEVGHSIRT LAMHRNAYKR MSQIQAFLGS VPQLTYQLYV SLISAEVPLG 
201 RVVLMVFSLV SVTYGATLCN MLAIQIKYDD YKIRLGPLEV LCITIWRTLE 
251 ITSRLLILVL FSATLKLKAV PFLVLNFLII LFEPWIKFWR SGAQMPNNIE 
301 KNFSRVGTLV VLISVTILYA GINFSCWSAL QLRLADRDLV DKGQNWGHMG 
351 LHYSVRLVEN VIMVLVFKFF GVKVLLNYCH SLIALQLIIA YLISIDFMLL 
401 FFQYLHPLRS LFTHNWDYL HCVCCHQHPR TRVENSEPPF ETEARQSW 



1 MNTRPQHSER TSTMDRVYEI PEEPNVDPVS SLEEDVIRGA NPRFTFPFSI 

51 LFSTFLYCGE AASALYMVRI YRKNSETYWM TYTFSFFMFS SIMVQLTLIF 

101 VHRDLAKDKP LSLFMHLILL GPVIRCLEAM IKYLTLWKKE EQEEPYVSLT 

151 RKKMLIDGEE VLIEWEVGHS IRTLAMHRNA YKRMSQIQAF LGSVPQLTYQ 

201 LYVSLISAEV PLGRWLMVF SLVSVTYGAT LCNMLAIQIK YDDYKIRLGP 

251 LEVLCITIWR TLEITSRLLI LVLFSATLKL KAVPFLVLNF LIILFEPWIK 

301 FWRSGAQMPN NIEKNFSRVG TLVVLISVTI LYAGINFSCW SALQLRLADR 

351 DLVDKGQNWG HMGLHYSVRL VENVIMVLVF KFFGVKVLLN YCHSLIALQL 

4 01 IIAYLISIGF MLLFFQYLHP LRSLFTHNW DYLHCVCCHQ HPRTRVENSE 

4 51 PPFETEARQS W 



FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 

Number of matches: 2 

1 302-305 NFSR 

2 323-326 NFSC 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

Number of matches: 2 

1 59-62 RKNS 

2 169-172 KRMS 

[3] PDOC00005 PS00005 PKC__PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 6 

1 64-66 TYR 

2 137-139 TRK 

3 157-159 SIR 

4 252-254 TSR 

5 264-266 TLK 

6 354-356 SVR 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 



FORM 2: 



1 
2 
3 



17- 20 SSLE 

18- 21 SLEE 
431-434 TRVE 
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[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

12 6-133 KEEQEEPY 

[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 

1 215-220 GATLCN 

2 321-326 GINFSC 

3 343-348 GQNWGH 

4 350-355 GLHYSV 

[7] PDOC00029 PS00029 LEUCINE_ZIPPER 
Leucine zipper pattern 

100-121 LFMHLILLGPVIRCLEAMIKYL 



Membrane spanning structure and domains: 
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BLAST Alignment to Top Hit: 

>gi| 6502963 [gb|AAF14527.1SAF155511_l (AF155511) KX antigen [Mus 
musculus] Length = 44 6 

Score = 366 bits (930), Expect = e-106 

Identities = 179/411 (43%), Positives = 265/411 (63%), Gaps = 11/411 (2%) 

Query: 33 FPFSILFSTFLYCGEAASALYMVRIYRKNSETYRMTYTFSFFMFSSIMVQLTLIFVHRDL 92 

FP S++ S FL+ E A+ALY4- YR + T F + +VQ TL+FVHRDL 

Sbjct: 3 FPASVIASVFLFVAETAAALYLSSTYRSAGDRMWQVLTLLFSLMPCALVQFTLLFVHRDL 62 

Query: 93 AKDKPLSLFMHLILLGPVIRCLEAMIKYLTLWKKEEQEEPYVSLTRKECMLI-DGEEVLIE 151 

++D+PL+L MHL4- LGP+ RC E Y + ++ EEPYVS+T+K+ + DG +E 
Sbjct: 63 SRDRPLALLMHLLQLGPLYRCCEVFCIYC— QSDQNEEPYVSITKKRQMPKDGLSEEVE 119 

Query: 152 WEVGHSIRTLAMHRNAYKRMSQIQAFLGSVPQLTYQLYVSLISAEVPLGRWLMVFSLVS 211 

EVG + L HR+A+ R S IQAFLGS PQLT QLY++++ + GR +M SL+S 
Sbjct: 120 KEVGQAEGKLITHRSAFSRASVIQAFLGSAPQLTLQLYITVLEQNITTGRCFIMTLSLLS 179 

Query: 212 VTYGATLCNMLAIQIKYDDYKIRLGPLEVLCITIWRTLEITSRLLILVLFSATLKLKAVP 271 

+ YGA CN+LAI+IKYD+Y++++ PL +CI +WR+ EI +R+++LVLF++ LK+ V 
Sbjct: 180 IVYGALRCNILAIKIKYDEYEVKVKPLAYVCIFLWRSFEIATRVIVLVLFTSVLKIWVVA 239 

Query: 272 FLVLNFLIILFEPWIKFWRSGAQMPNNIEKNFSRVGTLWLISVTILYAGINFSCWSALQ 331 

+++NF PWI FW SG+ P NIEK SRVGT +VL +T+LYAGIN CWSA+Q 

Sbjct: 240 VILWFFSFFLYPWIVFWCSGSPFPENIEKALSRVGTTIVLCFLTLLYAGINMFCWSAVQ 299 

Query: 332 LRLADRDLVDKGQNWGHMGLHYSVRLVENVIMVLVFKFFGVKVLLNYCHSLIALQLIIAY 391 

L++ + +L+ K QNW + ++Y R +EN FF + + C L+ LQL+I Y 

Sbjct: 300 LKIDNPELISKSQNWYRLLIYYMTRFIENSVLLLLWYFFKTDIYMYVCAPLLILQLLIGY 359 

Query: 392 LISIDFMLLFFQYLHPLRSLFTHNWD YLHCVCCHQHPRTRVENSEP 438 

I FML+F+Q+ HP + LF+ +V + L C C R ++SEP 

Sbjct: 360 CTGILFMLVFYQFFHPCKKLFSSSVSESFRALLRCACWSS LRRKSSEP 4 07 



ALIGNMENT OF FORM 1 AND FORM 2: 

>FORM 2 

Length = 462 (Length of FORM 1 = 449) 
Score - 900 bits (2301), Expect = 0.0 
Identities = 447/449 (99%), Positives = 447/449 (99%) 



FORM 


1: 


1 


MDRVYEIPEEPNVDPVSSLEEDVIRGANPRFTFPFSILFSTFLYCGEAASALYMVRIYRK 


60 








MDRVYEIPEEPNVDPVSSLEEDVIRGANPRFTFPFSILFSTFLYCGEAASALYMVRIYRK 




FORM 


2: 


14 


MDRVYEIPEEPNVDPVSSLEEDVIRGANPRFTFPFSILFSTFLYCGEAASALYMVRIYRK 


73 


FORM 


1: 


61 


NSETYRMTYTFSFFMFSSIMVQLTLIFVHRDLAKDKPLSLFMHLILLGPVIRCLEAMIKY 


120 








NSETY MTYTFSFFMFSSIMVQLTLIFVHRDLAKDKPLSLFMHLILLGPVIRCLEAMIKY 




FORM 


2: 


74 


NSETYWMTYTFSFFMFSSIMVQLTLIFVHRDLAKDKPLSLFMHLILLGPVIRCLEAMIKY 


133 


FORM 


1: 


121 


LTLWKKEEQEEPYVSLTRKKMLIDGEEVLIEWEVGHSIRTLAMHRNAYKRMSQIQAFLGS 


180 








LTLWKKEEQEE P YV S LTRKKML I DGEE VL IEWE VGHS I RTLAMHRN AYKRMSQ I QAFLGS 




FORM 


2: 


134 LTLWKKEEQEEPYVSLTRKKMLIDGEEVLIEWEVGHSIRTLAMHRNAYKRMSQIQAFLGS 


193 


FORM 


1: 


181 VPQLTYQLYVSLISAEVPLGRWLMVFSLVSVTYGATLCNMLAIQIKYDDYKIRLGPLEV 


240 








VPQLTYQLYVSLISAEVPLGRWLMVFSLVSVTYGATLCNMLAIQIKYDDYKIRLGPLEV 




FORM 


2: 


194 


VPQLTYQLYVSLISAEVPLGRVVLMVFSLVSVTYGATLCNMLAIQIKYDDYKIRLGPLEV 


253 


FORM 


1: 


241 


LCITIWRTLEITSRLLILVLFSATLKLKAVPFLVLNFLIILFEPWIKFWRSGAQMPNNIE 


300 








LCITIWRTLEITSRLLILVLFSATLKLKAVPFLVLNFLIILFEPWIKFWRSGAQMPNNIE 




FORM 


2: 


254 


LCITIWRTLEITSRLLILVLFSATLKLKAVPFLVLNFLIILFEPWIKFWRSGAQMPNNIE 


313 


FORM 


1: 


301 


KNFSRVGTLWLISVTILYAGINFSCWSALQLRLADRDLVDKGQNWGHMGLHYSVRLVEN 


360 



KNFSRVGTLWLISVTILYAGINFSCWSALQLRLADRDLVDKGQNWGHMGLHYSVRLVEN 
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FORM 2: 314 KNFSRVGTLVVLISVTILYAGINFSCWSALQLRLADRDLVDKGQNWGHMGLHYSVRLVEN 373 

FORM 1: 361 VIMVLVFKFFGVKVLLNYCHSLIALQLIIAYLISIDFMLLFFQYLHPLRSLFTHNWDYL 420 

VIMVLVFKFFGVKVLLNYCHSLIALQLIIAYLISI FMLLFFQYLHPLRSLFTHNWDYL 
FORM 2: 374 VIMVLVFKFFGVKVLLNYCHSLIALQLIIAYLISIGFMLLFFQYLHPLRSLFTHNWDYL 433 

FORM 1: 421 HCVCCHQHPRTRVENSEPPFETEARQSW 449 

HCVCCHQHPRTRVENSEPPFETEARQSW 
FORM 2: 434 HCVCCHQHPRTRVENSEPPFETEARQSW 462 



Hmmer search results (Pfam) : 

Model Description Score E-value N 

CE00306 E00306 Membrane_transport_protein_XK 390.8 1.3e-113 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

CE00306 1/1 31 416 1 384 [. 390.8 1.3e-113 



FIGURE 2, page 4 of 4 



1 TAT TAT TAT T ATTATTAAGA CGTAATCTTG CTCTGTTGCC CAGGCTGGAG 
51 TGCAGTGGCG TGATCTCAGC TCACTGCAAC CTCTGCCGTC CGGGTTCAAG 
101 TTTTTCTCCT GCCTCAGCCT CCTGAGTAGC TGGGATTACA GTCACGCACC 
151 ACCACGACCA GCTGATTTTT GTATTTTTAG TAG AG AT G GG GTTTCACCAC 
201 GTTGGCCAGG CTGGTTTCGA ACTCCTGACC TCAAGTGATC TGCCTGCCTC 
251 AGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAACCACTGT GCCTGGCCTT 
301 CAT CT AT ATT ATTACCAGGA GGCAGATGTG TTCTCTTTTT CTCTGAGGTT 
351 TAGAATTATG CAAATGAAGA TATGAAAACA AAAGCTCAGT GAGGTGGGGA 
401 GGATTACACT TAAGAATACA GGTAATTTTC AAAGCTCTTT AAGACACCCC 
451 TCTCAGTTTT TACTAACAGC TCTCTCTTGG CTCTTTGCCA GTCTGTTTAG 
501 AATTTGGCAC CTCTTCATAA CCTTTCAACC AAAGACCTGT AAGTTCATTC 
551 TAAAGCTCCT ATCCTGGCCT CATTTTGCAA GTGGAGAAAT CAAGGCATAA 
601 AATATGAGCT TTCAGTGTCT GTGGGCTGAC CTTGAGTCTT GACCTTTATC 
651 CTGTTCTATC TTCCCTCCGC CGAAAACTCT GACCCTATTC CTCCCAGGTT 
7 01 CCCCCTTCAT GAT AT TAT CT GGAGGGCAAT AGGACCTAGG GAGGTTCCAC 
751 CCTGCGGCGG AGGGAGACAC ACCTGCCTAA CAGCGTGGGT AGAGTGAGTG 
801 TTGAAGCAAG TCACTTAACT AGTTAGGGAG GGCGGGGTAG AAGTGGGGGC 
851 CTGCTGCTCC TAGGGAGGAG TAAAGCTGTG GCTCCTGCCT GGGTCTGGAG 
901 GTGGTGGTCA GAAGTGCTTC TGAAGAGCGG CCCAAGCCCC TTTTTGTCCC 
951 GCCACTCCAC AACGAG CAT C CCTCGGCTGG CCGCCTGCCC GGGAACTCTC 
1001 CGGCTGGTTT TGTTTGGCCG CAGCCGTCCC GCCCATCTCG CCCGCCCCCG 
1051 CCGTCCCGGT GCCTTAGTTT TTGAAGCTGC CGACCTCTCG CAGCTGGAAT 
1101 CGCAGACCAG GCAGGACCCT G G C AG C AG AC GGCGTCCAAG AGTTTGGCGA 
1151 CCTCCGTCCA GCCAGGTTGG CGCCCCGCAC ATCGTGCCTC T C AC TAG C AA 
1201 AGTTTCTCCG AGGAGAAGCA GCCCCTCCAG CCTTTTCTTC ATCCTGTAGA 
1251 GCGAGCGCGC TCTGCTTCTG TCCCTCAACA CTGCATTCGG AGACAGGGTG 
1301 GTGACAATAC TCCACTCCCG GGCCAGGCGG TCTTGGGGGC GGGGCTTGGG 
1351 GGAATCCGAG GAGCTATCCT GAGAACCCTG GACTCGGCAA AGGTCCTGAG 
14 01 AGCGCGCAGG TGAGCGGGCC AGCTGATAGC TACAGCCTAG CAATAGCTAG 
14 51 GATACCTAGG CACTGAACTG AATCCCCTCT TCTGCCCTCC TTCTTCTGCG 
1501 CCCGCTCTTC TGCCCTGGCT CAGCTCTCCG CTGACTTGAG AGGACACACT 
1551 GGTCAGGACT CTTTGTGAGG AGCTGCTGAG TGTCGGTGCC CCCGACAGAT 
1601 CGGCTACACC CTGCCTGAGG GGCTGCGAAA GGAGCCGCCA CGGAAGCCGC 
1651 TGTTCTCATG ACTCTTCACG TCCCTGGAGT TGGACTCTGG ATGGGGCGCT 
1701 GGGATGCTTG CTTTTGTCTT GTTCAAGTTT CACAGCAAGT ATGTTGACGA 
1751 TTGGAATCGG GGCCAATCAA GAGTCAAGTT CAAAGTGGTA CTCCTGGGCT 
1801 TTCCATCCCA GACTCCAAGT CGAATCTGAG TCTAGAAGAG AGCGGTTTCT 
1851 TGCTCTAACT AGTGAATCTC TGTTCCCAAA CTGGACTTGA CAGAGCTCTC 
1901 CTCACCTATA CTTGGACTGT AGCGGCCATA GGGTTCTCTT GGGGATGGGT 
1951 GGGAGGGTGC TAT GAAC AC A AGACCACAAC AT T C AG AAAG AACCTCGACA 
2001 AT GG AC AG AG TTTATGAAAT TCCTGAGGAG CCAAATGTGG ATCCGGTTTC 
2051 ATCTCTGGAG GAAGATGTCA TCCGTGGAGC CAACCCCCGA TTTACTTTTC 
2101 CATTTAGCAT CCTTTTCTCC ACCTTTTTGT ACTGTGGGGA GGCTGCATCT 
2151 GCTTTGTACA TGGTTAGAAT CT AT C G AAAG AAT AGT G AAA CTTACTGGAT 
2201 GACATACACC TTTTCTTTCT TTATGTTTTC AT C CAT TAT G GTCCAGTTGA 
2251 CCCTCATTTT TGTCCACAGA GAT C TAG C C A AAGATAAACC GCTATCATTA 
2301 TTTATGCATC TAATCCTCTT GGGACCTGTT AT C AG G T GAG CAACTTTTAA 
2351 ATCTTTTCCT TACCCCCCTA ACCCCACCCC AGACTTGGGC AGAGAAAGAT 
2401 GAAAGATTTA CAAGATGGAT ACTATGGCTC TAATCAATTC TCTCATTTCC 
2451 TCCCACTCTC GGCTTCCCTG TCTACCATTC AGAAAACTTA CCTGAAATCT 
2501 TAAATGCCAC CATGATGAAC ATGTGGTATG TACTTGTGTT CCAAAACAAT 
2551 GAACGATGCT ATTTGGGCTG TGTAAACTAG AATGGGAACA ACAAGACGTG 
2601 ATCACCCTGT GCATGAAGGC CATAGCTGCA GAGTGTGTAA TTTTATTTAA 
2651 AAAAATTTTT TTTTCTGAGA CAAGGTCTTG CTCTGCCTCC CAGGCTACAG 
2701 TGCAGTGGTG CGATCATGGC TCACTGCAGC CTTGATCTCC TGGGATCAAG 
2751 CGAACCTCCC ACCTCAGCCT CCAAGTAGCT GGGACCAAAG GAATGTGTCA 
2801 CCATGCCTGG TTAATTAAAA AAAAATTTTT ATAGGCCGGG TGTGGTGGCT 
2851 CATGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGCGGGT GGATCACCTG 
2901 AGGTCAGGAG TTCAAGACCA GCTGGCCAAC ATGGTGAAAC CCCTGTCTCT 
2951 ACTAAAAATC AGCTGGGTGT GGTGGCGCAT ATCTGTAATC CCAGCTACTC 
3001 TGGTGGCTGA GGCAGGAGAA TCACTTGAAC CCGGAAGGTA GAGGTTGCAG 
3051 TGAGCCAAGA TCGGTGCCAC TGCACTCCAG CCTGGGCGAT AGAGTGAGAC 
3101 TCCATCTCAA AAAAAAAAAA ATTTTTTTTG T AG AG AC G G G ATCTCGTTAT 
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3151 GTAGACTGGG CTCAAGTGAT CTTCCTGCCT CAGCCTCCCA AAGTGAGCCA 
3201 CCACGCCTGG TCTGAGTGTG TAATTTTGAC TCTACCTTTT TGGATGCTTT 
3251 GTAAATTGGA TAAAAGTTTC TTTACCCTGA GCTGCTTGGG CTGGTGCTAC 
3301 TGCCATTTTC AAATTTTCCA GAGTAATGTG ACATCTGGAA ACTATTTTAA 
3351 ACCATCTGTG GTAATCTGTA CCCCAACCCA ATATAGTTCA GTTCTCTGTC 
3401 GGTTTATCAG TTTCCTATTT ATCTCTTTGT ATATTTCTGC AATAAAGATA 
3451 CGAAGTTGGG AGGGGGCAAA GGAAGGCAGT TCATCTCTCT ATGTGGATGC 
3501 AGTAGCACAA TTTAATAGTA TCAAGTATTT C CAT T C AG AT TGCCTTGAAG 
3551 TGGAAAGAAT GCACTTAATC CTAGCGAGAT AGGCACCTGT GTCAACAGTC 
3601 TCATCTGGAT GCTATGGGGT TTTCAAGGTA GAGAGATGTT GCAAAACTTA 
3651 TGAGTTCAGG AGTAAGGAAT GGACCAAGTT TGTCTTGATT GCGAGAGAGG 
3701 CAGACAACTG CAGTCAGCCG AGGAATATGG GTCAGAGTGT TGCAATGGGA 
3751 AGATACCTCA TCATTAGACA ACTAAAAAGT CTGTGAAACT AATTAAGGAT 
3801 GGAACTCACT CCTTTATAAA AT T T CAT AT C TGTACACATG TATAATTTTT 
3851 ATTTGTCACT TATACCTCAA TAAGGCCAAA AAAATTTTTT ATCAATAAAT 
3901 TTTTAAGTGG GGAGGAATCG ATTAGGCTCT AT C AG AG AG A ATATGGGATA 
3951 TCAATGGAAA CAGTGGCCTG AAATTTGGAG TCTAGTCTTC CGCCTGTCAT 
4001 TGACTGGTTG TGTGTTCTTG GTAAAATCTC TGAAGATGGC TTCACAGGAA 
4051 GGCATATAGA GTTCCCTCAT CTGTAAAGCA AATGGGTTAG TCTAAATCAT 
4101 GGGTCTCAAA CTCAAACACT TGCAGGGACC AGGCAGGTAT CATAAATGAA 
4151 TGAAGCAGGC CTAGTATAAG AAAAAACAGT AGCCTTGTGT GAGATGATAA 
4201 ATGGAAACAA AGTCTCAGAG AAATACTGAG GAGTAGTGAG TACCATGGTA 
4251 ATCTGAAATC TTCATGACCT GCCTGAAGGA GGTAGCCCCT CTAGAGCCCT 
4 301 GGCGCATTGT TTCCATGTTG GAATTCAGAC CCAGTATTGC CAGATCCACT 
4351 AACTTTTCGG GAGATGCTCC CAAGACAGGA TTTTTATATG AAATGTCATG 
4 401 ATTTTAAATT TTCACAGCTG ACTAAAACAA TAACAACAAC AACACAGGAT 
4 451 GGACCAAACC ATATCTGTTG GT CAGAT AT A ACTCAGCTGG CCTATATGCA 
4501 TCTTTGGACT GGGTGATGTA AAGGTCCTTT ACGGTTCTAA ATCTTTGAAG 
4551 TTAAGCTGTA AAAGGAAGAC CTCATCTTGA CCTTGAAACC AAGAAATTTA 
4 601 AAGTTGTGAC TACAGGAGCA AATAAACCAT TCATCCCTCC TTTTTCAAAT 
4 651 ACAATATATT GAGTTAACCA ATCGAAAACT CTCAAGATAC AAATTTCAGA 
4701 AAGTACCCAG CTGCACCCTC CCCTCTTTTT GACTTCCTTT GTTTGCTTTG 
4751 TGAACCCTCT GTGTAGAGTG TTGAGTACTG TTTTTCATTT TTGTTGTTTA 
4 801 GCTTCCACTA GAAATGATTG GGAAGCATTT ATAACCTCAG GCAGCTTAGC 
4851 CCACAGCAGA GAAAAGATAA AAACTCATAA ATT AT ACT CT GGATTCGCTT 
4 901 ATTTTCAAGG CCAATTACTT GTTAGATAGG TAGGAACTTG ATTAGTGTTA 
4 951 TCAGGCACAT GAAGGTGCTT GTAGAGTCTG GGTGCCTTAC ATGAAATGCA 
5001 AGCATACTTC CGAAATGAAA ATGTACTCTA ATTTATTGAA GCTTATAAAT 
5051 GGACAAACAC CCTTACTTAA ACCAGAAAAT AGCCCTGAGA ATAGAAACAG 
5101 AACATTTATG TAAATGTAAA CGGAACATTT CATGCCACCA CCTTCTCCAA 
5151 TACTGTTCTC CAATTTAGCA AT AGT AC T G A TGGGTTGGGG TTAAAATCTA 
5201 AAATTTTTCA TTGAAAATGC ACTTATGCAG AACAAGAATA GGAAAAAAGT 
5251 GTTGCTTTTT CTTCTCTGTT CTTTCTTTGC ATCTTTTTCT TTCCCAGGTC 
5301 TTAGAGTTTG TCCCTAGAAG GTGACAATTT CAAACTACAT GCTTCAGAGT 
5351 GGTACACATG CATCAGTCTT AGGGTGATCT ATGGAGACTG GCAGCCAGCA 
5401 TATGTTCCAA ATTTTCCTAT CAGGAACTAA AGGCTAGAGA GCATATCAAC 
5451 CTCTGGGCTT GTCTTTGGTC TACTTTTCTG TTAAATTTCA TTGCTGTTAT 
5501 TATTATCCTC TCCTCCCATA ATTGCTTACC CTGTATTATT TTCTTCCTTC 
5551 TTATTCTTTC AT T TACT C AG CAAATATTTC TCAAATACCT ACTAAGTGAT 
5601 AAGAGCTGTA AACAAGATAA ATACAACCCT TGACCTCAGT CTCTTGGGCA 
5651 AGACGTGTTA ATGTCCACTA CAAATGTTCT TACTAGTCAT AAGTAGTCCA 
5701 CAGTTTTTAT TCATTAAAGG TGAGTGGCGA AGTGGTAACT CAGGTGTTCC 
5751 AGTAACAAGA ATGTTCTAGT TGCTTCTCTT CCACTTACCA CATCAGAACT 
5801 GCTAAAGACT TCTGATTTGT ATGGGGGAGG TGGGAGGGGC AGAGCAGGAA 
5851 ATGTCATCTT ACCCTTATTC CAAGGATGAT AGGCTTTCAT AAGGATGTTT 
5901 TTCTCTTCGT AAAGAAAGAA TCCAGTTTAA AAGGCTTTTG TCCACAAACA 
5951 GGACAAGAGG CACAAAAAGT AACTATTACA GTGATCTTTC GAGGGCCTAG 
6001 TTATGTAGTT CATTCAGGTT TGAGTTGTCG TCTTTTAAGT ACTTTTGTTG 
6051 CTTTGATGGC TTCCTGTGTA TATGAGATAT TTTTTTTCCT CTGATCTGTC 
6101 CCAAGACTTT TTGGCTGAGA TATGGTTGTG AGCCCTTTCT TGAAAAAGCA 
6151 GAATCTGGCC AGGCGCAGTG GCTCATGCCT GTAATCTCAG CACTTTGGGA 
6201 AGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCAAGA CCAGCCTGGC 
6251 CAACATGGTG AAAACCCGTC TCTACTAAAA ATACAAAAAA AAAAAAAACC 
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6301 TTAGCCGGAC ATGGTGGCAC ATGCCTGTAA TCCCAGCTAC TCAGGAGGCT 
6351 GAGGCAGGAG AATCGCTTGA ACCCAGGAGG CAGAGGTTAC AGTGAGCTGA 
6401 GATCGCGCCA GTGCACTCCA GCCTGGGCGA CAGAGCAAGA CTCTGTCTCA 
6451 AAAAAAAAAA AAAAAAGAAA GAAAGAAAAA GAAAAAGCAG AATCTAAAAC 
6501 TTTGGTTATG GAGCTGAATG CTTTGAGGGA GGAATGCTTT AC CT C AC G AA 
6551 TTTGAGGTAA GAAAACAGGG CCTTTGGAAC CTTCATTATT TTGCTAGGAA 
6601 AACAGTATCG ACTTAATACC TTTGTGTTCA AGGCACTTTT CTACCTGCCA 
6651 CAGGCCTATT CTTAAAAAGA CAAAACAATT CCTCGAGTCC TCAAACAAGT 
6701 ACTTCTGAAA CAGTGTTCTT AGGTCAGTCG ATGACTGAAC AAAAAT G GAT 
6751 TTAGATTCAT GTAACTTGTA GAAGGCATGA TCCACCCTTT GACTTATGAG 
6801 AAATGATCAG AACAGAAGAG AGAAAAAGAC AAAAAGTAGT GCAGGCTGGC 
6851 CATGGTGTCT CACACGTGTG ATCCCAGCAC TTTAGGATCC CAGCACTTTG 
6901 GGTCAAGGCA GTAGGATTGC TTGAGCCCAG GAGTTTGAGA CCAGTCTGGG 
6951 CAACATGTCT AGATCTCCTC TCTACACAAA TTAAAAATAG CTGGCATGGT 
7 001 GGCATGCGCC TGTAGTCCTA G CT ACT CAGA AGGCTGAGGT GGGAGGATCA 
7 051 TTTGAGCCTA GGAGGTCAAA GCTGCAATGA ATTATGATTG TGCCACTGCA 
7101 CTCCAGCCAG GGTGATGGAG TAAGACCTTG TCTCAAAAAT AAAATAAAGT 
7151 AGCACAACCT CCCCAAGTTA TTTTTTTCCC TCACTACAAC CTCCCTTCCC 
7201 AGGACAGCTT AGTTAAGTTT GCATGATGCT TTACTTCTGC AGATGTTTGG 
7251 AGGCCATGAT TAAGTACCTC ACACTGTGGA AGAAAGAGGA GCAGGAGGAG 
7301 CCCTATGTCA GCCTCACCCG AAAGAAGATG CTAATAGATG GCGAGGAGGT 
7351 GCTGATAGAA TGGGAGGTGG GCCACTCCAT CCGGACCCTG GCTATGCACC 
7401 GCAATGCCTA CAAACGTATG TCACAGATCC AAGCCTTCCT GGGCTCAGTG 
7451 CCCCAGCTGA CCTATCAGCT CTATGTGAGC CTGATCTCTG CAGAGGTTCC 
7501 CCTGGGTAGA GGTGAGTGGG GTCAGGAGAG GGGAGGGCTC CAGTTAAATC 
7551 AAGGGTCTTA GAAGTCTAGA CCCAAGCTGT CTAATAAACT GGCCACTAGC 
7 601 TTCATGTGGC TATTTAACTT AAAATTAAAT AAAATTAAAA ACTTGTTCAT 
7 651 T AAT ACT AG C TACATTTCAA GT T CT C AG C A GCCGTGTGTT GCTAGCAACT 

77 01 ACTGTATTGG ATGGCACAGG TATAAACATT TCCATCATCA CAGAAAGTTC 
7751 TATCGGACAG CACTGGGAGA TAGTTAAATA ACTTGTGGAG TCAGACATCT 

78 01 CAAGCCTGCC AGATTTCTTA AACAGGTAAG CTGTTTAGAC TAAAAATGTC 
7851 ACAGATAAAC CTTCTCTGGG CCCAGAAGAA GCTAGTAATA CCAGCAGTCA 

7 901 GTAGGATATT TTCCCTTGCC CAAAATGTTT AAATTATGCT GTTGTTTTGT 
7951 TTGTTTAAGG ATGGCAGTCT TTAATAAGAG GTTCCCAAAT AGTACTGATC 
8001 ATCAGAATCA TGTGATGAGC TTCTTTTTGA AATTATATTC ACTCCCCAGA 
8051 CTTGAATCAA TCTTAATATG TATTTCTAAA AGGTACCCAG TTGATTTTGA 
8101 TCAGCCACAT TTGGGAACCA ATGATTTAAT CATTTCTGCT AATGCCAGTG 
8151 GAGAGAAAGA AAAGGAGCGT GGGCTGGGCA CGGTGGTTCA AGCCTGTAAT 
8201 CCCAGCACTT TGGGAGGCCA AGGCGGGTGG ATCACAAGGT CAGGAGATTG 
8251 AGACCATCCT GGCTAACATG ATGAAAACCC GTCTGTACTA AAAATACAAA 
8301 AAATTAGCCG GGCGTGGTGG CAGGTGCCTG TAGTCCTAGC TACTCGGGAG 
8351 GCTGAGGCAG GAGAATGGCG TGAACCTGGG AGGCGGAGCT TGCAGTGAGC 
8401 CGAGATCGCG TCACTGCACT CCAGCCTGGG TGACAGAGCA AGACTCCGTC 
8451 TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA GCGTGGGGTT AAT ACT AAT G 
8501 AGAGTCAGGC CTGGACCAAG TTCTGACCTT CACTGTGATC TTTGGAGGAA 
8551 GTTACAAAGC AATCACTGAC CTAATTTCCC ACTTGTAGAA GAGGGATCCT 
8601 GAAATGAGTA AGACCTCTAG CAGAAGATGA AATGTGAGTC AGTGTTTTCA 

8 651 AAGTTGAGAT AAATTGTTGT T AAT G AAT T T TAACAGCCTG AGATTTGCTT 
8701 CATCTGCTTG GGCAGGCACT GGTATAGGTG TGGGTACAGG TTTGGACCAT 
8751 TTCCTATTAG ATTCTAACCC TGTTTGGCAA AGTCCCATGT CTCAAATAAG 
8801 GTAAGGAGAA AATTTGCCCT CTTTTGTCTT TTTTCCCCAC TCAGAATTGT 
8851 TCTTGAAGTT CTGTTGGTCT TGAAGCTTTT CACATACATA GTAGTTTGAG 
8901 GAGAAAACTC TTTGGAAATG ATGATGCTTT TCCTTTAAAT CATCTAATAA 
8951 AAATAGGTGT ACATTACGGC TGGGCATGAT GGCTCACGCC TGTAATCCTA 
9001 ACACTTTGGG AGGCCAAGAC AGGCAGATCA CTTGAAGTTG GGAGTTCAAA 
9051 ACCAGCCTGG CCAACATGTT GAAACCCCAT CTCTACTAAA AACACAAAAA 
9101 AAATCAAGGA TGGGCATGGT GGCTGATACC TGTAATCCCA GCACTTTGGG 
9151 AAGCCGAGGC AGGTGGATCA CCTGAGGTCA GGAGGTTGAG ACCAGCCTGG 
9201 CCTGGCGAAA CTCTGTCTCT ACTAAAAATA CAAAAATTAG CCGGGTATGG 
9251 TGGTGGATGC TTATAATCCC AGCTACTTGG GAGGCTGATG CAT G AG AAT C 
9301 ACTTGAACCT GGGAGCCGAG ATCTCACCGT TGCACTCCAG CCTGGGCAAC 
9351 AGAGCGAGAC TCTGTCTCAA AAAAAAATTC AGCCAGGCGT GGTGGTGGGT 
9401 GCCTGTAATC CCAGCTACTT GGGAGGCTGA AGCAGGAGAA TTGCTTGAAC 
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9451 CTGGGAGGTG GAGGTTGCAG TGAGCTGAGA CTGCACCACT GCACCCCAGC 
9501 CTGGGCGACA GAGGGAGACT CCCGTCTCAT AAATAAATAA ATAAATAACA 
9551 AAAGTAATAC ATGCACAAAA TGACATATAA GTAATTGTAT TTGCACAGAA 
9601 AATTTCTGGA AACTATGCAA GAAACTACCT CTGCGGAGTG GGAATGAAAA 
9651 GTCAGCAGTC TTACTTTTTA AAATTCTTCT GTATGGTTTG AAAATTTTTT 
97 01 TTGTGATCAT GCATTACTAG TTTTGGTCTT TATCTTTTTT T AAT T AC AAA 
9751 AGTCAGACAT GGTTATAGTA AAAATTAAAA ACCATACAGA ATAGATATAA 
9801 AATAGGAAAC GTAATCTCAC TCCCCAAAGA TAACCTCTGT TAATCATCCA 
9851 GTATATATCC TTCTGGACTT ATTTTTACTA TGTAAACATA AACATACATA 
9901 CAATATATAT TGTACATGTT TTTGCCCCAA AATGGACTGT ATGAAACATT 
9951 CTGTCAACAA AGTATTTTTC AAAAGTACAG TATGCCAGTA TGTCTTTTCT 
10001 CAAGTTATTT ATATATACAT GTATAACAAT AATAAATATA TAATATACAT 
10051 TTCCTTTATA TGAATTAGAC TATTTTTATT CTCCTAATTT TCTATTGATA 
10101 GGATTCTATT GATTGTCTCA AAAAGGAAAA AAAAAGGTAG CACAACCTCC 
10151 CCTAGTTATT TTTTCCCCTC ATTACAACCT CCCTTCTCAG GACAGCTTTA 
10201 GTTAAGTTCC CATGATGCTT TACTTCTGCA GATGTTTGGA GGCCATGAGT 
10251 AAGGACTTCA CACTGTGGAA GAAAGAGGGG CAGCAGGAGC CCTATGTCAG 
10301 CCTCACAAAT TTTAATTTTT CACAAAAAAG TTGTTTCCTA ATTGCAAATT 
10351 ATGCCACAGT AAACATCTTT AT AAAT AC C T GTGTACATGA ATGAGACTTT 
104 01 GT AG GAT AAA TTTATAGCAG TAGAATTGCT GGGTGGAAGG ATATGTATGT 
10451 TTAAAATTTT ATTGATATTG CCAAATAACT CTTCCAAAAA GATATATGAA 
10501 TTTATACTCT CACCAACAGT ATACAAATGT GCCTGTTTCT GTTTCTTCAT 
10551 ATCTTAAACT CAATATTCTT TATTTGTATA AT TAT AAAAT AATTGGCTTT 
10601 T AAAAT AAT T GACTTTTAAA ATAATTCGCT TTCTTTGGTT ATGAATGAAG 
10651 CTGAGCATCT TTTTGTGTTT GGTCATTGTG TGTTCTGTGA ATTGCTTGTT 
10701 TATATATTTT ACTCGCTTTT TCTAGTGGGT TGTCTTTTTC ATATTAATTT 
10751 TTAGGAGCTA TTTACTTATT CTTGTTATTA ATCCTTTCTC TGCTGTGAAT 
10801 ATGTATGCAT ATATTTGTAT AATTTTTTGC TTGTACATAC ACACATTTTA 
10851 AAT AT GT AT A TAG AT GT CAT ACGTGTAATA TGTGTGTGAT ATATTTAATA 
10901 T CCACAAT AC ACTTTGTAGT ATCTTCTGGC ATTCTGAAGT ATTACATTTT 
10951 TATGTATTCA AATTCTTTAT TGCTTTTAGA TTTTGTGCCT TTCTTACAAA 
11001 GGCCTATCAC ATCTCTCATC TGGTAGAACA ATTTTCCCCA ATCTTTTAAG 
11051 TAGATTAATT TTCAAGATAA TTTTTTAATT CATCCTACAA AAAACAAAGC 
11101 AAAATAATAA CAGCAAAAGA AAAAAACATT TCATTGAGAT TCCGATTGAG 
11151 ATTTGCATCA AATTACTTAG GTTATTTTGG GAGAATTTAC ATCTTTATAG 
11201 GATTGTTGGA TTTCATATTG T G AAAT GAT A AATCTCTCCA TTTTATTAAA 
11251 TATTTTAACA TGTACCTCAG TAAAATTGTA TAGTTTTCTT CAGTAAAGTT 
11301 GTATAGTTTT CTTGCTATGA GTCTTACATT TTTATAAGGC TTACTTTCAG 
11351 ATGTTCTATC AGTTTTAAAA TGACCTGATT TTCTAAGTAG C AGGAT AG T A 
11401 TCCAGGTAAA GTAAACCCAC CTACCATACT TTTGGAAATA GGGGGATGAT 
11451 GAAGATGACA AAGAATAGGA AGAAAGAGGA GGAGGGGGAG GAGGAGAAAA 
11501 AGGAAAGGAA GAAGGGAAGA AGGAAGAAGA AGAAAGAAGA AC AC AG C T AA 
11551 AAGAAT T T AC TAGGTTCTAG GCATTTCTCT AAGCCCTTTA CAT GT AAAT G 
11601 TTTATTTAAT CTTTACCACA ACCCTATGAG ATAATTATCA TTCTCATTTT 
11651 ACAGACGAAG AAACAGACGC ACTGAGAGTT TAAGTATATT CCCCCAAGGT 
117 01 CCCATAAGCA AAGATTGGAT TGGAATTCAG GGTGTTTGCC TCCAGAGCCT 
11751 GTGTATTTTG TTCTCTTATG GCATGAGTGT ATTTGTAGGG ACACAGATTG 
11801 AAAATGTTTT GACATTTATT GGAAGCATCA GGTTTTTTTC CTTCTGTTAC 
11851 ACT ACT AAT C AAT AAAT GAG TTCTAATGTA AGGGAAAGCA TCGCCACACA 
11901 GCTGGATGTA TGCTCTCACA TTCCCAGTTA CATAAGGTGC ATCAGCTCTT 
11951 GAGGATGGGA CTGAGAATGG TTGAGAAAGA CAAGAGTCAC CACTTCAAGA 
12001 GTCTCCAATT CCAGTCTCTG AGATTCCAAC ACTCTACTTA AAACTGAAGA 
12051 ACTCAGAGCT GTGCTTCCTT TTGGGTTTAC ATGGGGGAAA TCTTAACTTT 
12101 TCCTCCACTA AAAGTAAAAG ATTAAGTTGA CATCTCTATG GCCACCTTTT 
12151 CCCTACATCA AGTGTTTTAA TAGGAACAGA AAACTCCAGC TTTCCTTTTG 
12201 GATGAGTATT CCTCAGCCAT CCCACTTCTC T T GAG AG C AC TGGATTTTTC 
12251 TTAGTAATCA GCATCCTTTG ACATAAAGGA AGAAAAGGAA AGGGCCACCT 
12301 GTGTCATCTA TAGTTGAGGC TTAGGTAGGT TAGGGAGCAT CCTGGCTCTT 
12351 TAGGGCCACT ACTCTAACAT ATGGTTCCAT GGATGTCATG GGTGAGGCAA 
12401 CAGGGTTTGG AAATTTTTGA ACTACTCTGC TGCAAACTCA GAGATTCCTA 
12451 ACATATGGGG GTAATGAATT GACATTGCTG AT G AC AAAT A TAAGCAACTC 
12501 TTGAGTATCT CAGTGAATTG AGAACTGAGG TACATAGATA TTCAGTGACT 
12551 TCCAAAAGTT CCCATACAGC TGAACCAAGG ATTTCTTTCT TTCTTTCTTT 



FIGURE 3, page 4 of 8 



12601 CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT 
12 651 TTCTTTTCTT TCTTTCTTTC TTTCTTTCTT TCTTTCTTTC TTTCTTCTTT 
12701 CTTTCTCTTT CTTTCTCTCT TTTTCCTTTT CTTTTTCTTT CTTTTCTTTC 
12751 TCTCTTTCTT TCTCTCTCTC TCTTTCTGTC TTCCTCCCTT CCTCCCTTTC 
12801 TCTCTTTCTC CCTTTCTTTC TTCCCTTCCT CCTTACAGGC ATGCACCACC 
12851 ATGCCCAGCT AATTTTTGTA TTTTTAGTAG AGTACCGGGT TTCACCATGT 
12901 TGGTCAGGCT GGTCTTGAAC TCCTGACCTC AGGTGATCCA CCCACCTCAG 

12 951 CCCCGCAAAG TGCTTGGGAT TACAGGTGTG AGCCACCGTG TGCACGGCTG 
13001 GAACCAAGGA TTTCTAATTA GTTTTATTTT TTATTTTTTT TCTTTTTGAG 
13051 AAGGAGTCTC ACTCTGTCAC CCAGGCTGGA GTGCAGTGGG GCAATCTCAA 
13101 CTCACTGCAA CCTCTGCCTC GTGGGTTCAA GTGATTCTCG TGCCTCAGCC 
13151 TCCTGAGCAG CTGGGATTAC AGGCATGCCA TCATGCCTGG CTAATTTTTT 
13201 TTTCTTTCTT TTTGAGACAG AGTCTCACTC TGTTGCCCAG GCTGGATCGC 
13251 AGTGGTGCAA TCACGGCCCA CTATTACCTC TGCCTCCCAG GTTCAAGTAA 
13301 TTCTCCTGCC TCAGCATCCC AGGTAGCTGG GAATACAGGT GCACGCCACC 
13351 ACGCCTGACT AATATTTGTA TTTTTAGCGG AGATGGGGTT TCATCATGTT 
134 01 GGCCAGGCTG GTCTCGAACT CCTGACTTCA GGTGATCCAT CCGCCTTGGC 
134 51 CTCCCAATGT GCTGGGATTA CAGGCATGAG TCACCGCGCC CAGCCTAACT 
13501 AGGTATTTTA TGCACCTCTC CTAATCTCAG AAGTCTTCAT TAATTCCACA 
13551 AACATTTATT GAGCACCTGC TATGTTCCAG GTAATATGTT AGGCTATGGG 

13 601 AATACAGCAG TGAAGAAAAC ATGGTCCCTC CTGCCTTCAT GGAATTTTCA 

13 651 AT AC AC AT T T T G AC AC AT C A CTGAAGCTAA GTGTTCTAGA AACACACAAA 
13701 CAATGTTAGT TCCTTGAACA AGATATACAT CAAAGAAGGG ACTTCTATTA 
13751 GCAAGAGCGT TCTCTATGAG TCTCCTAAGA CTGGATTTTT TCAGATAGAG 
13801 TTCTTTCCGC CTTATTCAAT GTTTGCTCCG AAGCCTGCTT CAT CAGCAAA 
13851 GTCTGCCTGA TACCTTTATA TGTACTCTTC TCACGTTAGT GACTTCTCAA 
13901 TGTTCTAAGA CCCATGCTTT TTAAGGAAGT TTATTTTGTA TATTTATATG 
13951 ATTATTAAAG TGTTACAGTA TAT GTT CATC ATGAGAAATT TAGAAAATAG 
14001 AGAAATGTAG AGAAAAAGAT TTCTAAAACT GATATAAGAC TATCACACAC 
14051 AAAAAAAGAT ATTTTGGTTC ATTTTTTCAA TTTTTTGTGC ATCTATTTTG 
14101 TTTTATTGTA TATATTCAAG GTGTACAATG TGATGTTTCG ATGTATGTAC 
14151 ACATTGTGAA ATGATTACCA CAACCAAACT AATTAACACA TTCATCACCT 
14201 CACATAGTTA TCATTTTTGT ACGTGTGTGT GTGTGTGTGT GTGTGTGTGT 
14251 GGTAAAACTT AAGATCTACT CTCTTTAAAA ATTTCAAGTA CACAATACAT 
14301 TATTGTCAAC TATAGTCATC ATGTTGTACA TTAGAGCTCT GAAACTTATT 
14351 TATCTTATAA CTCTAAATTT GTAGCCTTTG ATCAAAATCC TTCTATTTCC 
14401 CTAAATCCCC ATCCCCTGGT AACCACCCAT TCTACTCTGT TGCTAGGTGT 

14 451 TCAACTTTTT TAGATTCCAC ATATAAGTAA GACAATGCAG TATTTTTCTT 
14501 TATGTGTCTA GCTCATTTCA CTTAGCATAA TGTCCTCTAG GTTCATCTGT 
14551 GTTGTCACGG ATGGCAGAGT TTCTGTAATT TTATGGTTGA ATAATATTCA 
14 601 TACACACACA CACACACACG CACACACACA CACACACACA CAGACACACC 
14 651 C AC C AG AT T T TCTTTATCCA TTCATCTGTC AAC AG AT AC T GAGTTTGTTT 
14701 CCATATCTTG GCTATTGAGA ATAATACTAC AATGAGCATG AGAGTGCAGA 
14751 TATCTCTTTG AGATACTGAT TTCCTTTAGG TATACACCCA GCAGTGGGAT 
14 801 TATTTGATCG TTTGGCCGTT CTGTTTGTAA TTTTTTTGGA GAACCTCCAT 
14851 GCTGTTTTCC ATAATGGCTG TGTCAGTTTA TGTTCCCACA AACAGTGTAC 
14 901 AAGGTCCTTT TTCTTACATC CCCACCAACA CTTTTTTTTT TTAATAATAG 

14 951 CCATTCTAAC AGGTGTGAGG TGATATCTCA TTGTGGCTTT GATTTGCATT 
15001 TTTGTGATGA TTAGTGATGT TGAACACCTT TTCATATACC TGTTGGCCGT 
15051 TTGTATACCG CCTTCGGAGA AAGTCTATTC AAGTGCATGC TATTTGTTTA 
15101 CATAGCTGTG AT CAT AT T T G CATTTGCTCT TAACTGGAGC TCTCAAGTCT 
15151 CACCCGTCAT CTCTCTGGAC CTCTGGGTTA TAAGTACAGC CTTCATTACC 
15201 AACATTGACT GATTGCCTGT TTTTGTTTTT GTTTTTGTTT TTAACAGTTG 
15251 TGCTAATGGT ATTTTCCCTG GTATCTGTCA CCTATGGGGC CACCCTTTGC 
15301 AATATGTTGG CTATCCAGAT CAAGTACGAT GACTACAAGA TTCGCCTTGG 
15351 GCCACTAGAA GTCCTCTGCA TCACCATCTG GCGGACATTG GAGATCACTT 
15401 CCCGCCTCCT GATTCTGGTG CTCTTCTCAG CCACTTTGAA ATTGAAGGCT 
15451 GTGCCCTTCC TAGTGCTCAA CTTCCTGATC ATCCTCTTTG AGCCCTGGAT 
15501 TAAGTTCTGG AGAAGTGGTG CCCAGATGCC CAATAACATT GAGAAAAACT 
15551 TCAGCCGGGT CGGCACTCTG GTGGTCCTGA TTTCAGTCAC CATCCTCTAT 
15601 GCTGGCATCA ACTTCTCTTG CTGGTCAGCT TTGCAGTTGA GGTTGGCAGA 

15 651 CAGAGATCTC GTCGACAAAG GGCAGAACTG G GG AC AT AT G GGCCTGCACT 
15701 ATAGTGTGAG GTTGGTAGAG AATGTGATCA TGGTCTTGGT TTTTAAGTTC 
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15751 TTTGGAGTGA AAGTGTTACT GAATTACTGT CATTCCTTGA TTGCCTTGCA 
15801 GCTCATTATT GCTTATCTGA TTTCCATTGG CTTCATGCTC CTTTTCTTCC 
15851 AGTACTTGCA TCCATTGCGC TCACTCTTCA CCCATAATGT AGTAGACTAC 
15901 CTCCATTGTG TCTGCTGTCA CCAGCACCCT CGGACCAGGG TTGAGAACTC 
15951 AGAGCCACCC TTTGAGACTG AAGCAAGGCA AAGTGTTGTC TGATTCTATT 
16001 TTCTGGGTAT TTTAGGAAGA GTTGGGAGTT GCCAAGAGTA ACCATGAAAT 
16051 TGAACGAAAG GATGAGGTTC ATGGGTGAGA TACCCATCAG TACATTTTCT 
16101 TGACTTTTCT GTTAAGCCTA T C AG AAG AAA GAGCAACTCC CAAATAGGTT 
16151 TTATTTTCTT AAGAGTTACC ACTATGTTTG GAAACAGGGG GTATCGACTA 
16201 TATAGTTGAA AGGGTCAGAA ATACCATTCA CACCCTTCTT ACCCAAGTCA 
16251 ATTGGAATAA CTTGTCTTCA AACACTTTAG GCTCTCTAAA GTGACCTTCT 
16301 AGCTCTGCTC ATTTGCTTGA TGCATTTCTG AGCTTTCCTG GGCTGAGCTG 
16351 AAGGCCCAGA ATCCCGCTAG AATATATCCT GACTGATCAG AGGATATGAC 
164 01 AGCTTACCAG CTAAGAGTAC CTCCCAGGAA ACAGTCTGAC TAATGTGGAA 
164 51 CCTGCAACTG TCAGTGTGGC TGGGGTCTTT TTAATTCCAG TGAGAAGCTC 
16501 TGGCTGAGAA GAAAATCACC ACTATTAAAA AAGCTGCTCC CCAAGCAGAT 
16551 TAGCTCTCTG TTAGGATTTT ACTAGTGGCC ATTCAGCAAG GACCTCTCTT 
16601 TACAGTGGCA CTTCATAGGC AC ACT CT AAG GAGAAAGTGC AGAGTAGAAT 
16651 TCCTTCAGGG CATAAGCCAA AATGACTCTT TTTCTCAGGG ACCTGCATGG 
16701 GCCTCCAGCT TGTCTATTGG AATTGTTAAG TGAAGCCTCT CACTTAGTGC 
16751 CT CAT TAG C A GAGATTTCCT CCAACCCAGC TTTTCTGTGC TCTTGGTATT 
16801 TTACTACTTG ATGTGGACCT CAGAGAAGCT GAACTGTAAT TGAAAATGTT 
16851 TCCGATGTGT GGAAGAAATG AAGACTGCTT TGTGTCTGCT GTTGTCCTGA 
16901 GTATTTCATT AATGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTATGTG 
16951 TATGTGTGTA GGGAAGAAAG TAATAATGGC TGAGACATCA CCTTCATGTT 
17001 GTTTGCGATT GGGATGGGTG ACTAACACTC CAAGGTAGAG TGAAGGCAGA 
17051 GGAGGGAAAC AAGATCACAT T AAAT CAT C A TCAGTACTGG TTTCTGCCTA 
17101 CAGGAGTTTA CTTTTTTTTT TTTTCCTTTT T T GAG AT G G A GTCTCGCTCT 
17151 GTTTTCTAGG CTGAAGTGCA GTGGTGTGAT CTTGGCTCAC TGCAGCCTCT 
17201 GCCTCCTGGG TTCAAGCAGN NNNNNNNNNN NNNNNNNNNN NNNNAGTGAT 
17251 CCACCCGCCT CGGTCTCCCA AAGCACTGGG AT T AC AG G C A TGAGCCACCT 
17301 CACGCGGCCA GGATTTTACT TTATAACAAG GAACATATGT TTATCAACCC 
17351 TCTGTTCGTT CCTATACCCC CAGTGGACGA ATGCATGTCT CCTTTTCTCC 
17 401 T AT AT CT C AA TGTTTACATC T CAT AT C AGT TGGGTATTTT G AT AGGAAT G 
17451 TCAGCCAGCT ACCTCTGAGG TAACCAAGGG ATTGAAGTTA CTATGGCCAC 
17501 TGCCTATTGG GACCAAATAT CCCAGCATTT ACCTAACTAA TGCTTGCCCC 
17551 T C AC AG AC C A GGAAAATTAA AAGAACTCCT AGTCGTGGCC ACCACAACAC 
17 601 TTCAAGAAAT TGTGAACAAT CTGACCTAGG GCTTCCTGTC CTCATCCAAT 
17651 TTTACTCTTG GTAGCATGCT AAGAATTTAT CTTTAGTCAT TTCCTCTCCT 
17701 CTTATCCAAT GTCAGGACAT TATGTTGAGG GAGTTCTCTC TTCTAAGTAG 
17751 CAGGGCTGTT AACCAAAGTA TCTTATTTCT TGGCATGGCT AGCATGGTTT 
17801 TCCCTTCATC AGCCACTGTT TGGGACTAAA AGGATTATAT ACTTAATTTG 
17851 GGAGAGACTG TATGGACTTG CTTTGGAACA GTGGAGAGCT CCTTTCTTCA 
17 901 ACCCCAACTC CCCCATTCCA TTTTTCATGA T G AAG AG AC T TAGTTATTGT 
17951 CATATAAAGC TCACCTGCTG TCTTCTAACT ATGTTATTCA AGG 
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CTTTTAAATCTTTTCCTTACCCCCCTAACCCCACCCCAGACTTGGGCAGAGAAAGATGAA 
AGATTTACAAGATGGATACTATGGCTCTAATCAATTCTCTCATTTCCTCCCACTCTCGGC 
TTCCCTGT CT AC CAT T C AG AAAACT T AC C T G AAAT CT T AAAT G C C AC CAT GAT G AAC AT G 
TGGTATGTACTTGTGTTCCAAAACAATGAACGATGCTATTTGGGCTGTGTAAACTAGAAT 

[G,C] 

GGAACAACAAGACGTGATCACCCTGTGCATGAAGGCCATAGCTGCAGAGTGTGTAATTTT 
ATTTAAAAAAATTTTTTTTTCTGAGACAAGGTCTTGCTCTGCCTCCCAGGCTACAGTGCA 
GTGGTGCGATCATGGCTCACTGCAGCCTTGATCTCCTGGGATCAAGCGAACCTCCCACCT 
CAGCCTCCAAGTAGCT GGGAC CAAAGGAATGTGTCACCATGCCTGGTT AAT TAAAAAAAA 
ATTTTTATAGGCCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAG 



2655 TTTCCTTACCCCCCTAACCCCACCCCAGACTTGGGCAGAGAAAGATGAAAGATTTACAAG 
ATGGATACTATGGCTCTAATCAATTCTCTCATTTCCTCCCACTCTCGGCTTCCCTGTCTA 
CCATTCAGAAAACTTACCTGAAATCTTAAATGCCACCATGATGAACATGTGGTATGTACT 
TGTGTTCCAAAACAATGAACGATGCTATTTGGGCTGTGTAAACTAGAATGGGAACAACAA 
GACGTGATCACCCTGTGCATGAAGGCCATAGCTGCAGAGTGTGTAATTTTATTTAAAAAA 

[A, T] 

TTTTTTTTTCTGAGACAAGGTCTTGCTCTGCCTCCCAGGCTACAGTGCAGTGGTGCGATC 
ATGGCTCACTGCAGCCTTGATCTCCTGGGATCAAGCGAACCTCCCACCTCAGCCTCCAAG 
TAGCTGGGACCAAAGGAATGTGTCACCATGCCTGGTTAATTAAAAAAAAATTTTTATAGG 
CCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATC 
ACCTGAGGTCAGGAGTTCAAGACCAGCTGGCCAACATGGTGAAACCCCTGTCTCTACTAA 



3693 TCTCTGTCGGTTTATCAGTTTCCTATTTATCTCTTTGTATATTTCTGCAATAAAGATACG 
AAGTTGGGAGGGGGCAAAGGAAGGCAGTTCATCTCTCTATGTGGATGCAGTAGCACAATT 
TAATAGTATCAAGTATTTCCATTCAGATTGCCTTGAAGTGGAAAGAATGCACTTAATCCT 
AGCGAGATAGGCACCTGTGTCAACAGTCTCATCTGGATGCTATGGGGTTTTCAAGGTAGA 
GAGATGTTGCAAAACTTATGAGTTCAGGAGTAAGGAATGGACCAAGTTTGTCTTGATTGC 
[G,A] 

AGAGAGGCAGACAACTGCAGTCAGCCGAGGAATATGGGTCAGAGTGTTGCAATGGGAAGA 
TACCTCATCATTAGACAACTAAAAAGTCTGTGAAACTAATTAAGGATGGAACTCACTCCT 
TTATAAAATTTCATATCTGTACACATGTATAATTTTTATTTGTCACTTATACCTCAATAA 
GGCCAAAAAAATTTTTTATCAATAAATTTTTAAGTGGGGAGGAATCGATTAGGCTCTATC 
AGAGAGAATATGGGATATCAATGGAAACAGTGGCCTGAAATTTGGAGTCTAGTCTTCCGC 



3992 C G AG AG AGG C AG AC AACT G CAGT CAGC C GAG G AAT AT GGG T C AG AG T GT T G C AAT G GG AA 

GATACCTCATCATTAGACAACTAAAAAGTCTGTGAAACTAATTAAGGATGGAACTCACTC 
CTTTATAAAATTTCATATCTGTACACATGTATAATTTTTATTTGTCACTTATACCTCAAT 
AAGGCCAAAAAAATTTTTTATCAATAAATTTTTAAGTGGGGAGGAATCGATTAGGCTCTA 



FIGURE 3, page 7 of 8 



TCAGAGAGAATATGGGATATCAATGGAAACAGTGGCCTGAAATTTGGAGTCTAGTCTTCC 
[G,C] 

CCTGTCATTGACTGGTTGTGTGTTCTTGGTAAAATCTCTGAAGATGGCTTCACAGGAAGG 
CATATAGAGTTCCCTCATCTGTAAAGCAAATGGGTTAGTCTAAATCATGGGTCTCAAACT 
CAAACACTTGCAGGGACCAGGCAGGTATCATAAATGAATGAAGCAGGCCTAGTATAAGAA 
AAAACAGTAGCCT T GTGTGAGATGATAAATGGAAACAAAGT CT CAGAGAAATACT G AGGA 
GTAGTGAGTACCATGGTAATCTGAAATCTTCATGACCTGCCTGAAGGAGGTAGCCCCTCT 

6285 TCTTTCGAGGGCCTAGTTATGTAGTTCATTCAGGTTTGAGTTGTCGTCTTTTAAGTACTT 
TTGTTGCTTTGATGGCTTCCTGTGTATATGAGATATTTTTTTTCCTCTGATCTGTCCCAA 
GACTTTTTGGCTGAGATATGGTTGTGAGCCCTTTCTTGAAAAAGCAGAATCTGGCCAGGC 
GCAGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAAGCTGAGGTGGGTGGATCACCTGA 
GGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAAACCCGTCTCTACTAAAAATAC 

E",A] 

AAAAAAAAAAAAACCTTAGCCGGACATGGTGGCACATGCCTGTAATCCCAGCTACTCAGG 
AGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAGAGGTTACAGTGAGCTGAGATCG 
CGCCAGTGCACTCCAGCCTGGGCGACAGAGCAAGACTCTGTCTCAAAAAAAAAAAAAAAA 
AG AAAG AAAG AAAAAG AAAAAG C AGAAT C T AAAACT TT GGT T AT G GAGCT GAAT GC T T T G 
AGGGAGGAATGCTTTACCTCACGAATTTGAGGTAAGAAAACAGGGCCTTTGGAACCTTCA 

7066 TT GTAGAAGGCAT GATC CACCCTT TGACT T ATGAGAAAT GAT CAGAAC AGAAGAGAGAAA 

AAGACAAAAAGTAGTGCAGGCTGGCCATGGTGTCTCACACGTGTGATCCCAGCACTTTAG 
GATCCCAGCACTTTGGGTCAAGGCAGTAGGATTGCTTGAGCCCAGGAGTTTGAGACCAGT 
CTGGGCAACATGTCTAGATCTCCTCTCTACACAAATTAAAAATAGCTGGCATGGTGGCAT 
GCGCCTGTAGTCCTAGCTACTCAGAAGGCTGAGGTGGGAGGATCATTTGAGCCTAGGAGG 

[A,T] 

CAAAGCTGCAATGAATTATGATTGTGCCACTGCACTCCAGCCAGGGTGATGGAGTAAGAC 
CTTGTCTCAAAAATAAAATAAAGTAGCACAACCTCCCCAAGTTATTTTTTTCCCTCACTA 
CAACCTCCCTTCCCAGGACAGCTTAGTTAAGTTTGCATGATGCTTTACTTCTGCAGATGT 
TTGGAGGCCATGATTAAGTACCTCACACTGTGGAAGAAAGAGGAGCAGGAGGAGCCCTAT 
GT C AG C C T C AC C CGAAAG AAG AT G CT AAT AGAT G G C GAGGAG GT G CT GAT AG AAT GG GAG 

14223 AAGGAAGTTTATTTTGTATATTTATATGATTATTAAAGTGTTACAGTATATGTT CATCAT 

GAG AAATTT AGAAAATAGAGAAAT GT AGAGAAAAAGAT T T CT AAAACTG AT AT AAGACT A 
TCACACACAAAAAAAGATATTTTGGTTCATTTTTTCAATTTTTTGTGCATCTATTTTGTT 
TTATTGTATATATTCAAGGTGTACAATGTGATGTTTCGATGTATGTACACATTGTGAAAT 
GAT T ACC AC AAC CAAAC T AAT T AAC ACAT T CAT CAC C T C AC AT AG T TAT CAT T T T T GT AC 
[ — T G] 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGGTAAAACTTAAGATCTACTCTCTTTAAAAATT 
TCAAGTACACAATACATTATTGTCAACTATAGTCATCATGTTGTACATTAGAGCTCTGAA 
ACTTATTTATCTTATAACTCTAAATTTGTAGCCTTTGATCAAAATCCTTCTATTTCCCTA 
AATCCCCATCCCCTGGTAACCACCCATTCTACTCTGTTGCTAGGTGTTCAACTTTTTTAG 
ATTCCACATATAAGTAAGACAATGCAGTATTTTTCTTTATGTGTCTAGCTCATTTCACTT 

16915 ATAGGCACACTCTAAGGAGAAAGTGCAGAGTAGAATTCCTTCAGGGCATAAGCCAAAATG 
ACTCTTTTTCTCAGGGACCTGCATGGGCCTCCAGCTTGTCTATTGGAATTGTTAAGTGAA 
GCCTCTCACTTAGTGCCTCATTAGCAGAGATTTCCTCCAACCCAGCTTTTCTGTGCTCTT 
GGTATTTTACTACTTGATGTGGACCTCAGAGAAGCTGAACTGTAATTGAAAATGTTTCCG 
ATGTGTGGAAGAAATGAAGACTGCTTTGTGTCTGCTGTTGTCCTGAGTATTTCATTAATG 
[-,G,T] 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTATGTGTGTAGGGAAGAAAGTAATA 
ATGGCTGAGACATCACCTTCATGTTGTTTGCGATTGGGATGGGTGACTAACACTCCAAGG 
T AGAGT GAAGGCAGAGGAGGGAAACAAGAT C AC ATTAAAT CAT CAT C AGT ACT GGT TT CT 
GCCTACAGGAGTTTACTTTTTTTTTTTTTCCTTTTTTGAGATGGAGTCTCGCTCTGTTTT 
CTAGGCTGAAGTGCAGTGGTGTGATCTTGGCTCACTGCAGCCTCTGCCTCCTGGGTTCAA 
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